Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Jeevan Nair PK, Sudharsan V, Kiran K Iyer, Prof. Rosemary Varghese
DOI Link: https://doi.org/10.22214/ijraset.2023.53121
Certificate: View Certificate
Tweet Sentiment & Emotion Analysis using Bi-LSTM in RNN. Twitter has developed into a useful medium for sharing ideas, attitudes, and feelings. Applications like opinion mining, market research, and social trend analysis all depend on the sentiment and emotion of tweets. The Bi-LSTM architecture in RNN (Recurrent Neural Networks) is used in this study to present an advanced method for sentiment and emotion analysis on Twitter. By utilising machine learning techniques, the objective is to increase the precision and efficacy of mood and emotion analysis. The study emphasises both conventional text analysis and real-time data analysis. Organisations and governmental bodies may continuously monitor sentiment and emotional patterns on Twitter thanks to real-time analysis, which enables them to react quickly to emerging problems or crises. In a typical text analysis, historical tweet data is examined to learn more about user viewpoints, emotional patterns, and sentiment distributions. The Bi-LSTM architecture is used because it can effectively capture the context and sequential dependencies found in tweets. To ensure consistent analysis, the system gathers real-time tweets and conducts the preprocessing stages. Monitoring sentiment changes, emotional responses, and new trends on Twitter are all made possible by this study. The goal of the research is to improve the precision and efficacy of sentiment and emotion analysis on Twitter by utilising machine learning techniques, real-time data analysis, and standard text analysis. The findings and conclusions will aid in understanding public mood and feelings in the digital age.
I. INTRODUCTION
Twitter sentiment and emotion analysis is determining the sentiment and emotion of tweets using natural language processing (NLP) and machine learning techniques. Sentiment analysis is the process of recognising and categorising positive, negative, or neutral opinions conveyed in text. Emotion analysis, on the other hand, entails determining the underlying emotion underlying a tweet.
The Bi-directional Long Short-Term Memory (Bi-LSTM) model is a prominent machine learning technique for sentiment and emotion analysis. The Bi-LSTM model is a recurrent neural network (RNN) that can process sequential data like text.
The Bi-LSTM model is made up of many layers of LSTM cells. Each LSTM cell is in charge of remembering the prior state as well as determining the current state.
The Bi-LSTM model is referred to as "bi-directional" since it processes the input sequence both forward and backward. As a result, the model is able to capture the context of the full input sequence.
The Bi-LSTM model can be trained on a large dataset of labelled tweets in the context of Twitter sentiment and emotion research. Based on the sentiment conveyed, the labelled tweets can be classified as good, negative, or neutral. The dataset can also be labelled according to the underlying emotion expressed in the tweet, such as anger, joy, sadness, or fear.
Once trained, the Bi-LSTM model can be used to predict the sentiment and emotion of new tweets. The model takes a tweet's word sequence as input and generates a probability distribution across the possible sentiment or emotion categories.
The Bi-LSTM model modifies the weights of its neurons during training to minimise the error between anticipated and real sentiment or emotion labels. This is known as backpropagation, and it is an important part of neural network training.
Other machine learning methods, such as convolutional neural networks (CNNs) and support vector machines (SVMs), can be employed in addition to the Bi-LSTM model for Twitter sentiment and emotion analysis. However, the Bi-LSTM model has been shown to be particularly effective for analyzing sequential data such as text.
Finally, Twitter sentiment and emotion analysis using the Bi-LSTM model is an effective method for analysing the sentiment and emotion communicated in tweets. With the growing usage of social media and the massive volumes of data created by these platforms, sentiment and emotion analysis can provide significant insights into people's opinions and emotions on a variety of topics and concerns.
II. EXISTING SYSTEM
The current sentiment and emotion analysis system for Twitter makes use of conventional machine learning techniques including feature engineering and classification algorithms. To infer sentiment and mood from tweets, it may also use rule-based systems or lexicon-based techniques. The abundance of sarcasm, figurative language, and changing linguistic trends on Twitter are just a few of the problems these algorithms frequently run into when dealing with unstructured text data.
The lexicon-based strategy was a popular approach. These systems used sentiment lexicons created specifically for Twitter data. Lexicons contained annotated words and phrases with sentiment scores. Sentiment analysis algorithms would compute a tweet's overall sentiment by summing the sentiment scores of its constituent terms. While this method was straightforward and easy to use, it struggled with slang, neologisms, and the ever-changing nature of Twitter language.
Another method required the use of machine learning methods like Support Vector Machines (SVM) or Naive Bayes. These methods need feature engineering, which involves extracting multiple features from the text, including word frequencies, n-grams, part-of-speech tags, and syntactic patterns. These features were then used to train a model that could categorise tweets based on their emotion. However, these models frequently struggled to capture contextual information and nuances of Twitter language.
The usage of sentiment-specific features was a popular strategy for Twitter sentiment analysis. Positive and negative emoticons, prolonged words (e.g., "loooove"), repeated letters (e.g., "happyyyy"), and capitalization were among these characteristics. These features attempted to capture the sentiment portrayed in the text using specific patterns seen on Twitter. This technique, however, was significantly reliant on handcrafted features and lacked the ability to learn complex patterns automatically.
Prior to the introduction of RNNs, current algorithms for Twitter sentiment analysis struggled to capture the contextual information, sarcasm, irony, and other nuances inherent in Twitter data. RNNs revolutionised sentiment analysis by modelling long-term dependencies and capturing the context in which words and phrases arise, thanks to their capacity to capture sequential dependencies in text. This resulted in considerable increases in sentiment analysis performance, notably in capturing Twitter data's distinctive properties.
The following are some of the system's drawbacks:
III. OBJECTIVES
The Bi-LSTM model is used for Twitter sentiment and emotion analysis in order to accurately predict the sentiment and emotion represented in tweets. This technique has the ability to provide useful insights into people's opinions and emotions about a variety of issues and concerns.
The precise objectives and goals related with this analysis are as follows:
IV. PROPOSED SYSTEM
The following steps would be involved in the proposed system for Twitter sentiment and emotion analysis using Bi-LSTM in RNN:
Overall, the proposed Twitter sentiment and emotion analysis system based on Bi-LSTM in RNN can provide accurate and important insights into the sentiment and emotion represented in tweets. This can assist businesses, organisations, and individuals in making educated decisions based on public opinion and so improving their products and services.
To summarise, Twitter sentiment and emotion analysis using Bi-LSTM in RNN is a powerful and effective method for interpreting the sentiment and emotions represented in tweets. Data gathering, pre-processing, feature extraction, model training, evaluation, deployment, and visualisation are all stages of the proposed system. The method can provide accurate and important insights into public opinion and emotions on a wide range of topics and situations, allowing corporations, organisations, and people to make well-informed decisions based on popular opinion.
Sentiment140, DeepMoji, and LSTM-ER are three existing systems that have proved the usefulness of employing Bi-LSTM in RNN for Twitter sentiment and emotion analysis. These algorithms have demonstrated great accuracy in sentiment and emotion recognition and are widely employed in research and industry.
This degree of accuracy suggests that the sentiment and emotion analysis algorithm can consistently categorise the majority of tweets into their corresponding sentiment categories (positive, negative, neutral), as well as emotional labels.
The use of graphs and charts to visualise information has aided researchers and analysts in gaining a better understanding of the data. However, there are certain limitations to employing Bi-LSTM in RNN to analyse Twitter sentiment and emotion. One of the most difficult aspects is dealing with sarcasm and irony, which can be difficult to detect in text. Another difficulty is dealing with noisy and unclear material, such as misspellings and informal language, which can impair analysis accuracy.
Cultural and linguistic differences can impair the accuracy of sentiment and emotion analysis, because various languages and cultures may represent sentiment and emotion in different ways. Future research might concentrate on overcoming these obstacles and enhancing the accuracy and robustness of sentiment and emotion analysis on Twitter.
Furthermore, incorporating other methodologies, such as natural language processing (NLP) and machine learning (ML), can improve the analysis's accuracy and effectiveness.
Overall, Twitter sentiment and emotion analysis utilising Bi-LSTM in RNN has a high potential for delivering useful insights into public opinion and emotions, and it has the ability to have a substantial impact on a variety of sectors such as business, politics, and social sciences. It is a fast-evolving area, and future improvements are projected to improve its capabilities and usefulness even further.
IX. FUTURE SCOPE
[1] Chen, T., & Duan, Y. (2019). Twitter Sentiment Analysis using Bi-LSTM-CNN Model. Proceedings of the 2019 International Conference on Internet Computing and Big Data (ICBD), 85-89. DOI: 10.1145/3322645.3322676. [2] Zhang, H., & Wang, L. (2020). Emotion Analysis on Twitter Data using Bi-LSTM Network. Proceedings of the 2020 International Conference on Artificial Intelligence and Computer Science (ICAICS), 279-283. DOI: 10.1145/3429486.3429493. [3] Felbo, B., et al. (2017). Using Millions of Emoji Occurrences to Learn Any-domain Representations for Detecting Sentiment, Emotion, and Sarcasm. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1615-1625. DOI: 10.18653/v1/D17-1169. [4] Jindal, A., & Liu, B. (2008). Opinion Spam and Analysis. Proceedings of the International Conference on Web Search and Data Mining (WSDM), 219-230. DOI: 10.1145/1341531.1341557. [5] Kim, Y. (2014). Convolutional Neural Networks for Sentence Classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1746-1751. DOI: 10.3115/v1/D14-1181. [6] Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735-1780. DOI: 10.1162/neco.1997.9.8.1735. [7] Zhang, X., & LeCun, Y. (2015). Text Understanding from Scratch. Proceedings of the 2015 International Conference on Machine Learning (ICML), 1242-1250. [8] Alsmadi, I., et al. (2018). Sentiment Analysis of Twitter Data: A Comprehensive Review. International Journal of Computational Linguistics Research, 9(2), 69-82. [9] Pak, A., & Paroubek, P. (2010). Twitter as a Corpus for Sentiment Analysis and Opinion Mining. Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC), 1320-1326. [10] Go, A., et al. (2009). Twitter Sentiment Classification using Distant Supervision. Proceedings of the 23rd International Conference on Computational Linguistics (COLING), 2, 2009, 401-408.
Copyright © 2023 Jeevan Nair PK, Sudharsan V, Kiran K Iyer, Prof. Rosemary Varghese. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET53121
Publish Date : 2023-05-27
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here